Chapter 10 : Introduction to Scientific Data Mining : Direct Kernel Methods & Applications

نویسندگان

  • Mark J. Embrechts
  • Boleslaw Szymanski
  • Karsten Sternickel
چکیده

The purpose of this chapter is to give a brief overview of data mining and to introduce direct kernel methods as a general-purpose and powerful data mining tool for predictive modeling, feature selection and visualization. Direct kernel methods are a generalized methodology to convert linear modeling tools into nonlinear regression models by applying the kernel transformation as a data pre-processing step. We will illustrate direct kernel methods for ridge regression and the self-organizing map and apply these methods to some challenging scientific data mining problems. Direct kernel methods are introduced in this chapter because they transpire the powerful nonlinear modeling power of support vector machines in a straightforward manner to more traditional regression and classification algorithms. An additional advantage of direct kernel methods is that only linear algebra is required.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Applications of Kernel Methods

In this chapter, we give a survey of applications of the kernel methods introduced in the previous chapter. We focus on different application domains that are particularly active in both direct application of well-known kernel methods, and in new algorithmic developments suited to a particular problem. In particular, we consider the following application fields: biomedical engineering (comprisi...

متن کامل

Graphical models - methods for data analysis and mining

The best ebooks about Graphical Models Methods For Data Analysis And Mining that you can get for free here by download this Graphical Models Methods For Data Analysis And Mining and save to your desktop. This ebooks is under topic such as data mining with graphical models pdfsmanticscholar data mining with graphical models borgelt data mining with graphical models springer data mining with poss...

متن کامل

Support vector machines for classification: a statistical portrait.

The support vector machine is a supervised learning technique for classification increasingly used in many applications of data mining, engineering, and bioinformatics. This chapter aims to provide an introduction to the method, covering from the basic concept of the optimal separating hyperplane to its nonlinear generalization through kernels. A general framework of kernel methods that encompa...

متن کامل

Towards XML Mining: The Role of Kernel Methods

XMLmining is a unique application of data mining, in that it deals with structured XML contents. The introductory paper provides a brief but comprehensive review of milestones towards XML mining. XML mining is not a one-day outcome by chance, but an accumulated inheritance of continuous evolution from data mining throughout text mining and web mining. Furthermore, the paper envisages the applic...

متن کامل

An Introduction to Uncertain Data Algorithms and Applications

In recent years, uncertain data has become ubiquitous because of new technologies for collecting data which can only measure and collect the data in an imprecise way. Furthermore, many technologies such as privacy-preserving data mining create data which is inherently uncertain in nature. As a result there is a need for tools and techniques for mining and managing uncertain data. This chapter d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003